Name Disambiguation Method Based on Multi-step Clustering

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Multi-stage Clustering Framework for Chinese Personal Name Disambiguation

This paper presents our systems for the participation of Chinese Personal Name Disambiguation task in the CIPSSIGHAN 2010. We submitted two different systems for this task, and both of them all achieve the best performance. This paper introduces the multi-stage clustering framework and some key techniques used in our systems, and demonstrates experimental results on evaluation data. Finally, we...

متن کامل

Clustering Technique in Multi-Document Personal Name Disambiguation

Focusing on multi-document personal name disambiguation, this paper develops an agglomerative clustering approach to resolving this problem. We start from an analysis of pointwise mutual information between feature and the ambiguous name, which brings about a novel weight computing method for feature in clustering. Then a trade-off measure between within-cluster compactness and among-cluster se...

متن کامل

An Improved Name Disambiguation Method Based on Atom Cluster

An improved name disambiguation method based on atom cluster. Aiming at the method of character-related properties of similarity based on information extraction depends on the character information, a new name disambiguation method is proposed, and improved k-means algorism for name disambiguation is proposed in this paper. The cluster analysis cluster is introduced to the name disambiguation p...

متن کامل

A Term-Based Driven Clustering Approach for Name Disambiguation

Name disambiguation in databases is a non-trivial task because people’s names are often not unique and usually only a limited information is associated with each name in the database. For example, in DBLP many authors share the same name, whereas we do not have any unique identifier to distinguish them. To make it worst, we may not always be able to access the full contents of the materials, un...

متن کامل

A Heuristic-based Hierarchical Clustering Method for Author Name Disambiguation in Digital Libraries

In this paper, we propose a heuristic-based hierarchical clustering (HHC) method to deal with the name disambiguation problem. The method successively fuses clusters of citations of compatible authors based on several heuristic and similarity measures on the components of the citations (e.g., coauthors, title of the work, publication venue). In each phase, the information of fused clusters is a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Procedia Computer Science

سال: 2016

ISSN: 1877-0509

DOI: 10.1016/j.procs.2016.04.237